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Proposal  Title:  Repair  Machinery  for  Radiation-Induced  DNA  Damage 

Principal  Investigator:  David  M.  Wilson  III,  Ph.D. 

Biology  &  Biotechnology  Research  Program,  L-441 
Lawrence  Livermore  National  Laboratory 
Regents  of  the  University  of  California 
7000  East  Avenue 
Livermore,  CA  94551-0808 

Office:  (925)423  0695 
Fax:  (925)422  2282 

INTRODUCTION 

Current  methods  to  treat  breast  cancer  and  preserve  normal  tissue  involve  total  mastectomy 
followed  by  radiotherapy.  The  use  of  postoperative  radiotherapy  however  has  produced  mixed 
results.  While  some  randomized  studies  indicate  that  there  is  no  major  benefit  from 
postmastectomy  radiation  treatment,  there  exists  a  subgroup  of  patients  that  benefit  from  this  type 
of  strategy.  Thus,  it  is  imperative  that  we  design  effective  prognostic  tools  to  identify  patients  that 
will  profit,  as  well  as  those  that  may  be  harmed,  by  adjuvant  radiotherapy.  Radiation  is  cytotoxic  to 
cells  in  part  because  it  introduces  lethal  genetic  damage.  Thus,  understanding  the  repair 
mechanisms  for  radiation-induced  DNA  alterations  and  having  prior  knowledge  of  a  patient’s 
radiation-specific  repair  capacity  will  help  design  more  effective  treatment  regimes  and  help 
determine  which  patients  will  be  most  responsive  to  radiation  exposure.  We  intend  to  identify  and 
characterize  factors  involved  in  correcting  common  radiation-induced  damages.  Such  studies  will 
provide  the  necessary  tools  (both  specific  DNA  repair  genes  and  biochemical  assays)  for 
predicting  an  individual’s  repair  capacity  and  thus  potential  radiation  sensitivity  (genetic 
predisposition). 

BODY 


Objectives: 

1.  To  construct  an  APE1  knockout  cell  line  (months  1-24)  and  examine  the  role  of  Apel  in  DNA 
repair  (months  25-36). 

Radiation  induces  an  array  of  DNA  damages,  including  abasic  lesions  and  strand  breaks 
harboring  3’-blocking  termini  such  as  phosphoglycolate  and  phosphate  groups  (1,2).  The  human 
Apel  protein  has  been  shown  to  incise  at  AP  sites  and  remove  a  subset  of  3’-damages,  as  well  as 
to  stimulate  the  DNA-binding  activity  of  several  oncoproteins  (e.g.  p53,  Fos  and  Jun)  in  vitro 
(3, 4, 5, 6).  Yet  despite  the  basic  understanding  of  the  biochemical  properties  of  Apel,  the  in  vivo 
function  of  the  protein  remains  largely  unclear,  particularly  as  it  relates  to  radiation  protection. 

APE1  knockout  cell  lines  represent  an  essential  step  towards  defining  the  biological  contribution  of 
this  mammalian  protein. 


5 


We  predict  that  Ape  1 -deficient  cells  will  demonstrate  a  quantitatively  significant  defect  in  the 
repair  of  certain  DNA  damages,  but  of  which  damages  needs  to  be  determined.  We  have  selected 
Chinese  Hamster  Ovary  (CHO)  as  the  target  cell  line  for  knockout  generation,  since  they  present 
the  many  advantages  described  in  the  original  proposal.  Towards  generating  Apel -null  cells,  we 
first  determined  the  APE1  gene  copy  number  in  the  CHO  AA8  cell  line.  Fluorescence  in  situ 
hybridization  with  an  APE1  bacterial  artificial  chromosome  (BAC)  genomic  fragment,  which  we 
isolated  from  a  CHO  BAC  library,  revealed  that  APE1  exists  as  a  single  copy  gene  in  AA8  (see 
Appendix,  Figure  1);  thus  only  a  single  targeting  event  is  needed  to  generate  a  knockout  line.  As 
noted  in  the  original  proposal,  we  had  previously  obtained  an  APE1  CHO  genomic  clone  spanning 
roughly  13.5  kb  (graciously  provided  by  Pablo  Arenas,  UTEP).  This  fragment  has  been  partially 
sequenced  and  used  to  construct  the  targeting  vectors,  which  are  designed  to  delete  a  portion  of 
the  Apel  C-terminal  domain  known  to  be  essential  for  nuclease  activity.  In  our  first  screen  of  360 
colonies  for  an  APE1  knockout,  we  were  unsuccessful  in  identifying  a  targeted  deletion  event.  This 
result  prompted  us  to  consider  the  possibility  that  APE1  null  cell  lines  are  inviable.  Thus,  to  prevent 
future  complications  of  this  sort,  we  have  stably  integrated  a  Tetracycline  (Tc)-regulated  human 
APE1  cDNA  into  the  AA8  genome.  Of  roughly  32  selected  stably-integrated  clones,  twelve  were 
found  to  overexpress  the  human  Apel  protein  at  significant  levels.  Of  these,  one  (termed  AA8 
human  Apel -3)  was  selected  based  on  the  criteria  that  it  expresses  Apel  at  the  highest  level  in  the 
absence  of  Tc  (which  is  used  to  shut  down  human  APE1  cDNA  expression),  and  is  most 
sufficiently  turned  off  in  the  presence  of  Tc  (see  Appendix,  Figure  2).  When  compared  to  AA8  cells 
alone  (i.e.  cells  not  harboring  the  Tc-regulated  human  APE1  cDNA),  the  AP  endonuclease  activity  of 
the  AA8  human  Apel -3  cell  line  is  5  to  8-fold  higher  in  the  absence  of  Tc;  notably,  this  higher 
expression  is  more  similar  to  what  is  observed  in  human  HeLa  cell  extracts  (see  Appendix,  Table 
1).  Importantly,  the  AA8  human  Apel -3  cell  line  exhibits  stable  human  Apel  protein  expression  for 
at  least  6  weeks.  Using  this  line,  we  performed  another  targeting  screen  with  the  original  knockout 
vector,  and  again  were  unable  to  detect  any  targeted  events  in  >700  independent  clones.  We  have 
since  designed  a  new  targeting  construct,  assuming  that  the  initial  construct  was  problematic,  and 
are  currently  performing  analysis  of  newly  transfected  AA8  human  Apel -3  cells.  The  objective, 
once  we  have  obtained  an  APEI-deleted  AA8  human  Apel-3  cell  line,  is  to  add  Tc  to  the  medium, 
shutting  off  human  Apel  protein  expression,  and  to  monitor  cell  growth  in  the  absence  of 
significant  abasic  endonuclease  activity.  If  such  a  function  is  essential  for  viability,  cells  will  be 
unable  to  survive  in  the  presence  of  Tc.  As  described  in  the  original  proposal,  if  cells  do  survive,  we 
will  examine  for  sensitivity  of  these  abasic  endonuclease-deficient  cells  to  several  DNA-damaging 
agents. 

As  a  fortuitous  product  of  this  research,  the  AA8  human  Apel-3  cell  line  presents  a  unique 
tool  to  examine  the  role  of  Apel  in  the  repair  of  various  DNA  damages  as  a  rate-limiting  factor.  That 
is  to  say,  since  AA8  maintains  roughly  5  to  8-fold  less  abasic  endonuclease  repair  activity  (see 
Appendix,  Table  1),  if  this  activity  is  rate-limiting  in  the  repair  of  specific  forms  of  DNA  damage,  then 
the  AA8  human  Apel-3  cell  line  will  exhibit  improved  resistance.  Alternatively,  the  higher  level  of 
Apel  protein  expression  may  sensitize  cells  to  certain  DNA-damaging  agents,  by  generating 
incised  single-strand  break  intermediates  that  cannot  be  processed  efficiently  by  subsequent 
pathway  repair  factors.  Thus,  results  from  these  studies  will  test  the  feasibility  of  regulating  Apel 
protein  levels  as  a  means  of  protecting  or  sensitizing  target  cells  from  the  effects  of  anti-cancer 
agents  such  as  ionizing  radiation,  bleomycin  and  the  many  alkylating  compounds.  These 
experiments  present  a  new  line  of  investigation  not  directly  proposed  in  the  original  grant. 
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2.  To  characterize  hYjeR  and  hXPMC2/Hem45  as  3’  to  5’  exonucieases  and  3’-repair  enzymes 
(months  1-36). 

Apel  has  been  shown  to  possess  a  poor  3’-repair  activity  on  certain  DNA  substrates  (3), 
thus  alternative  repair  mechanisms  presumably  exist  for  such  damages.  We  will  search  for 
additional  3’-damage  processing  activities,  testing  both  the  APE1  knockout  cells  for  this  function  as 
well  as  several  recently  identified  human  candidate  3’  to  5’  exonucleases. 

Previous  computer-based  studies  identified  three  proteins,  XPMC2,  ISG20/Hem45  and  YjeR, 
which  contain  the  structurally-conserved  exonuclease  motifs  of  the  3'-5'  exonuclease  superfamily 
(7,8).  Using  traditional  protein  expression  and  purification  methods,  we  have  purified  the  human 
YjeR  and  ISG20/Hem45  proteins  to  near  homogeneity.  Substrate  analysis  has  revealed  that  the 
human  YjeR  protein  exhibits  nuclease  activity  on  primarily  nucleic  acid  substrates  of  <5  nucleotides 
in  length  (see  Appendix,  Nguyen  et  al.,  in  press).  Thus,  we  have  named  the  human  protein  Sfn,  for 
small  fragment  nuclease.  Such  a  substrate  specificity  indicates  that  Sfn  is  unlikely  involved  in 
repairing  genetic  damage  on  large  chromosomal  DNA,  yet  presumably  plays  a  role  in  recycling 
short  DNA  fragments,  such  as  those  generated  during  repair  excision  events.  We  will  not  pursue 
further  characterization  of  Sfn,  since  (given  its  substrate  preference)  this  protein  is  unlikely  to 
contribute  significantly  to  ionizing  radiation  resistance. 

Substrate  analysis  with  ISG20/Hem45  found  that  this  human  protein  degrades  both  RNA 
and  DNA  substrates,  with  DNA  at  roughly  50-fold  less  efficiency  (see  Appendix,  Nguyen  et  al.,  in 
preparation).  Furthermore,  substrate  specificity  was  selective  for  single-stranded  nucleic  acid 
fragments.  These  results  suggest  that  this  protein,  like  Sfn,  is  unlikely  involved  in  resistance  to 
ionizing  radiation-induced  DNA  damage.  Thus,  we  are  not  planning  to  pursue  further 
characterization  of  ISG20/Hem45.  As  Objective  2  of  the  original  proposal  (i.e.  to  characterized  Sfn 
and  ISG20/Hem45  biochemically)  will  be  completed  in  the  next  several  months  (much  quicker  than 
anticipated),  we  will  instead  focus  our  future  efforts  on  constructing  the  APE1  knockout  cells 
(Objective  1),  characterizing  the  AA8  human  Apel -3  cell  line,  and  identifying  yet  other  alternative  3’- 
repair  proteins  or  mechanisms  if  appropriate. 

KEY  RESEARCH  ACCOMPLISHMENTS 

•  Completed  characterization  of  the  human  YjeR  equivalent,  which  we  have  termed  Sfn  for  small 
fragment  nuclease.  This  data  will  appear  in  an  upcoming  issue  of  Journal  of  Biological 
Chemistry  (see  Appendix  for  included  pre-print). 

•  Completed  characterization  of  the  human  Hem45  protein.  Results  from  these  studies  are 
currently  being  organized  for  submission  (see  Appendix  for  included  rough  draft). 

•  Established  an  Apel -overexpressing  mammalian  cell  line  (AA8  human  Apel -3)  that  will  permit 
analysis  of  the  role  of  this  protein  in  DNA  damaging  agent  resistance,  particularly  as  a  rate- 
limiting  factor.  This  cell  line  permits  regulation  of  Apel  protein  expression  and  is  currently 
being  used  in  the  knockout  targeting  experiments  in  the  event  that  a  null  cell  is  lethal. 

•  Have  completed  the  biochemical  characterization  of  the  abasic  endonuclease  activities  of  the 
overexpressing  cell  line  (AA8  human  Apel -3)  relative  to  cell  line  control  extracts. 

•  Constructed  two  targeting  constructs  for  knocking  out  the  APE1  gene  in  CHO  cells.  The  first 
construct  has  been  abandoned  after  screening  over  1000  colonies  for  a  targeting  event  without 
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success.  Current  screening  is  ongoing  after  transfection  of  the  second  targeting  construct  into 
the  AA8  human  Apel-3  cell  line. 

REPORTABLE  OUTCOMES 

•  Nguyen,  L.H.,  J.P.  Erzberger,  J.  Root  and  D.M.  Wilson  III.  (2000)  The  human  homolog  of  E.  coli 
Orn  degrades  small  single  stranded  RNA  and  DNA  oligomers.  J.  Biol.  Chem.  In  Press. 

•  Nguyen,  L.H.  and  D.M.  Wilson  III.  (2000)  Human  Hem45/lsg20  degrades  single-stranded  RNA 
and  DNA  in  vitro.  In  preparation. 

•  Development  of  the  AA8  human  Apel-3  cell  line. 

CONCLUSIONS 

Understanding  the  repair  mechanisms  for  ionizing  radiation  (IR)-induced  DNA  damage  and 
having  prior  knowledge  of  a  patient’s  radiation-specific  repair  capacity  will  help  determine  which 
patients  will  be  most  responsive  to  radiation  therapy  and  design  more  effective  treatment  regimes. 
The  objective  of  this  work  has  been  to  define  the  contributions  of  the  mammalian  protein  Apel,  and 
other  candidate  nucleases,  to  the  repair  of  IR-induced  genetic  damage.  We  are  currently 
constructing  cell  lines  that  lack  Apel  protein  and  will  determine  the  sensitivity  of  these  mutant  cells 
to  various  DNA-damaging  agents,  e.g.  IR.  In  the  course  of  these  studies,  we  generated  a 
mammalian  cell  line  that  overexpresses  Apel  roughly  5  to  8-fold.  This  line,  relative  to  the  non¬ 
overexpressing  controls,  will  be  used  to  examine  whether  Apel  operates  as  a  rate-limiting  factor  in 
the  repair  of  certain  exogenously-induced  DNA  damages  (e.g.  abasic  sites  and  3’-phosphate  and 
phosphoglycolate  blocking  damages).  Such  studies  will  address  the  potential  value  of  modulating 
Apel  repair  capacity  in  terms  of  protecting  or  sensitizing  target  cells  from  the  effects  of  IR  anti¬ 
cancer  treatments.  Additionally,  we  have  completed  the  biochemical  characterization  of  two  protein 
factors,  human  Sfn  and  Hem45/lsg20.  Results  from  these  studies  indicate  that  neither  protein  has 
a  substrate  specificity  that  is  consistent  with  a  major  role  in  repairing  IR-induced  DNA  damage.  It 
thus  appears  that  Apel  is  the  predominant  (and  perhaps  only)  repair  factor  for  3’-damages, 
although  alternative  less  prominent  corrective  mechanisms  must  exist.  Future  studies  will  focus 
more  on  obtaining  accurate  quantitative  determinations  of  the  overall  contribution  of  Apel  to  3’- 
damage  repair  and  IR  protection  and  less  on  the  search  for  alternative  3’-repair  mechanisms. 
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APPENDICES 

Figure  1 .  Fluorescence  in  situ  hybridization  analysis  of  CHO  APE1 .  Hybridization  of  genomic  DNA 
from  an  APE1 -containing  BAC  clone  with  an  AA8  metaphase  chromosome  spread  indicates  a 
single  hybridizing  region  (seen  in  green;  indicated  by  arrow).  These  results  reveal  that  APE1  is 
present  as  a  single  copy  in  CHO  AA8  DNA. 

Figure  2.  Western  blot  and  incision  activity  analysis  of  protein  extracts  from  AA8  hApel-3  cells.  AA8 
hApel-3  cells  were  either  not  exposed  (-)  or  exposed  (+)  to  0.3  ug  Tc/ml  for  0  to  96  hr.  The  cells 
were  harvested  at  the  time  indicated  and  lysed  by  sonication  to  prepare  protein  extracts.  Protein 
concentrations  were  determined  by  Bradford  assay.  A)  10  pg  of  protein  extract  was 
electrophoresed  on  12%  SDS-polyacrylamide  gel  and  then  transferred  to  nitrocellulose  and 
immunodetected  using  an  anti-human  APE1  antibody  and  chemilluminescence.  “Apel” 
represents  0.1  pg  of  purified  human  protein  as  the  positive  control.  The  immunoblot  demonstrates 
that  exposure  to  Tc  shuts  off  human  Apel  expression  (indicated  by  arrow)  to  nondetectable  levels 
by  72  and  96  hr.  B)  0.1  pg  protein  extract  was  used  in  incision  assays  as  described  by  Erzberger  & 
Wilson  (1999  J.  Mol.  Biol.  290:447-457)  with  1  pmol  of  an  18  base  pair  duplex  DNA  substrate 
harboring  an  abasic  site  in  the  center.  Specific  activity  is  reported  as  pmol  of  substrate  converted 
from  the  full-length  non-incised  18mer  fragment  to  the  incised  9mer  product  per  minute  per  mg  cell 
extract.  Tc  exposed  cells  exhibit  a  reduced  level  of  abasic  incision  activity  over  96  hr. 

Nguyen,  L.H.,  J.P.  Erzberger,  J.  Root  and  D.M.  Wilson  III.  (2000)  The  human  homolog  of  E.  coli  Orn 
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We  report  here  the  identification  of  human  homo- 
logues  to  the  essential  Escherichia  coli  Orn  protein  and 
the  related  yeast  mitochondrial  DNA-escape  pathway 
regulatory  factor  Ynt20.  The  human  proteins  appear  to 
arise  from  alternatively  spliced  transcripts,  and  are 
thus  identical,  except  the  human  Ynt20  equivalent  con¬ 
tains  an  NH2-terminal  extension  that  possesses  a  pre¬ 
dicted  mitochondrial  protease  cleavage  signal.  In  vitro 
analysis  revealed  that  the  smaller  human  protein  exhib- 
its  a  3'  to  5'  exonuclease  activity  for  small  (primarily  £5 
*'  nucleotides  in  length)  single-stranded  RNA  and  DNA 
oligomers.  We  have  named  this  human  protein  Sfh  for 
small  fragment  nuclease  to  reflect  its  broad  substrate 
range,  and  have  termed  the  longer  protein  hSfna.  Sfn 
prefers  Mn2+  as  a  metal  cofactor  and  displays  a  temper¬ 
ature-resistant  (to  50  °C)  nuclease  activity.  Kinetic  anal¬ 
ysis  indicates  that  Sfn  exhibits  a  similar  affinity  for 
small  RNAs  and  DNAs  (Km  of  ~1.5  /am),  but  degrades 
small  RNAs  ~4-fold  more  efficiently  than  DNA.  Mutation 
of  a  conserved  aspartate  (Asp136)  to  alanine  abolishes 
both  nuclease  activities  of  Sfh.  Northern  blot  analysis 
revealed  that  a  1-kilobase  transcript  corresponding  to 
SFN  and/or  SFNa  (these  mRNAs  differ  by  only  two  nu¬ 
cleotides)  is  expressed  at  varying  levels  in  all  fetal  and 
adult  human  tissues  examined.  Expressed  tag  sequence 
clone  analysis  found  that  the  two  splice  variants,  SFN  to 
SFNu,  are  present  at  a  ratio  of  roughly  4  to  1,  respec¬ 
tively.  The  results  presented  within  suggest  a  role  for 
AQ:  B  human  Sfn  in  cellular  nucleotide  recycling. 


Nucleases  are  critical  components  of  DNA  and  RNA  metab¬ 
olism,  carrying  out  functions  in  DNA  repair,  replication  and 
recombination,  and  RNA  processing  and  degradation.  Several 
candidate  nucleases  were  recently  classified  as  members  of  the 
3'  to  5'  exonuclease  superfamily  (1).  This  superfamily  includes 
RNases  (such  as  RNase  T  and  D),  the  proofreading  domains  of 
pol  I  family  DNA  polymerases,  and  DNases  that  exist  as  inde¬ 
pendent  proteins  (such  as  Escherichia  coli  Exol)  or  as  domains 
within  larger  polypeptides  (such  as  a  region  within  the  Wern¬ 
er’s  syndrome  helicase)  (1-3).  Homology  within  the  superfam¬ 
ily  is  centered  around  three  conserved  exonuclease  motifs  (Exo 
I,  II,  and  III)  (2).  The  crystal  structure  of  the  Klenow  fragment 


*  This  work  was  supported  by  the  U.  S.  Department  of  Energy  by 

Lawrence  Livermore  National  Laboratory  under  contract  number 
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CA79056  and  U.  S.  Army  Grant  BC980514  (to  D.  M.).  The  costs  of 
publication  of  this  article  were  defrayed  in  part  by  the  payment  of  page 
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in  accordance  with  18  U.S.C.  Section  1734  solely  to  indicate  this  fact. 
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of  E.  coli  DNA  polymerase  I  and  complementing  site-directed 
mutagenesis  studies  indicate  that  the  Exo  I,  II,  and  III  motifs 
are  clustered  around  the  active  site  and  contain  four  conserved 
negatively  charged  residues  that  are  critical  for  coordinating 
the  two  metal  ions  involved  in  phosphodiester  bond  cleavage 
(reviewed  in  Ref.  4). 

Based  on  conservation  of  the  Exo  motifs,  the  protein  encoded 
by  the  E.  coli  open  reading  frame  YjeR  was  placed  into  the  3'  to 
5'  exonuclease  superfamily  (5).  This  protein  was  subsequently 
shown  in  vitro  to  be  an  exonuclease  specific  for  RNA  molecules 
shorter  than  about  5  nucleotides  (nt)1  in  length  (6).  Based  on 
this  ribonuclease  activity,  YjeR  was  renamed  Orn  for  oligori- 
bonuclease  (6).  Data  base  mining  revealed  a  human  EST  hom¬ 
olog  to  YjeR,  indicating  an  evolutionary  conserved  role  for  the 
encoded  protein  (5). 

Orn  is  one  of  eight  distinct  3'  to  5'  exoribonucleases  present 
in  E.  coli  (7).  It  is  a  processive  enzyme  that  initiates  attack  at 
a  free  3'  hydroxyl  group  on  single  stranded  RNA,  releasing 
5'-mononucleotides  in  a  sequential  manner  (6).  Notably,  unlike 
previously  characterized  E.  coli  ribonucleases,  ORN  is  an  es¬ 
sential  gene.  Experiments  using  a  temperature-sensitive  ORN 
construct  and  pulse-chase  experiments  with  radiolabeled  RNA 
revealed  an  accumulation  of  small  RNA  oligonucleotides  at  the 
non-permissive  temperature  (8).  These  observations  suggested 
a  role  for  Om  in  the  final  steps  of  mRNA  degradation,  since 
mRNA-degrading  enzymes  such  as  RNase  II  and  polynucle¬ 
otide  phosphorylase  generate  small  oligonucleotide  fragments 
that  require  further  processing  to  mononucleotides  (8). 

An  ORN  homologue  is  also  present  in  yeast  Saccharomyces 
cerevisiae  (5,  9).  This  YNT20  gene  encodes  a  larger  protein  that 
has  been  shown  to  localize  to  the  mitochondria  (9).  YNT20 
plays  a  role  in  a  S.  cerevisiae  mitochondrial  DNA  escape  path¬ 
way,  a  process  that  involves  the  transfer  of  genetic  material 
from  the  mitochondria  to  the  nucleus  (9). 

It  was  suggested,  based  on  sequence  comparisons,  that  the  S. 
cerevisiae  Ynt20  protein  is  potentially  capable  of  hydrolyzing 
both  DNA  and  RNA  (9).  Although  it  has  been  reported  that  the 
E.  coli  Orn  protein  exhibits  no  detectable  DNase  activity  on 
double  stranded  T7  chromosomal  DNA  (10),  its  ability  to  de¬ 
grade  short  deoxyribonucleotides  apparently  has  not  been  ex¬ 
amined.  We  set  out  to  examine  the  substrate  specificity  of  the 
human  Om  equivalent,  with  a  particular  interest  in  its  activity 
for  DNA  molecules,  and  demonstrate  here  that  this  protein 
cleaves  not  only  short  RNAs  but  also  short  DNAs  in  a  3'  to  5' 
direction.  Thus,  we  suggest  that  the  human  enzyme  be  named 
Sfh  for  small  fragment  nuclease.  We  discuss  our  results  as  they 
may  relate  to  the  cellular  role  of  this  protein  in  nucleic  acid 
metabolism. 


1  The  abbreviations  used  are:  nt,  nucleotide(s);  EST,  expressed  se¬ 
quence  tag;  PCR,  polymerase  chain  reaction. 
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Fig.  1.  Comparison  of  amino  acid  and  DNA  sequences  of  the  human  SFN.  A,  the  amino  acid  sequence  of  the  SFN  subfamily.  Amino  acids 
in  black  are  identical,  gray  are  conserved,  and  white  are  non-conserved.  This  alignment  was  produced  with  the  Boxshade  program.  hSFN  is  the 
Homo  sapiens  Sfn  protein  sequence  (accession  number  Q9Y3B8);  hSFNa,  H.  sapiens  Ynt20  (CAB53690);  mSFN,  M.  musculus  Sfn;  EcORN,  E.  coli 
Ora  (P39287);  CeSFN,  Caenorhabditis  elegans  Sfn  (CAA96590);  AtSFN,  Arabidopsis  thaliana  Sfn  (AAC77855);  SpSFN,  Schizosaccharomyces 
pombe  Sfn  (CAB37438);  and  ScYNT20,  S.  cerevisiae  Ynt20  (P54964).  The  conserved  exonuclease  motifs  of  the  3'  to  5'  exonuclease  superfamily  (Exo 
I,  II,  and  III)  are  shown  as  boxes.  The  stars  indicate  the  catalytically  important  tyrosine  and  the  four  conserved  negatively  charged  residues  that 
are  involved  metal  ion  binding  (1,  4).  The  nuclear  localization  signal  (NLS),  KKRK,  is  boxed  (18).  •  is  used  to  mark  the  putative  mitochondrial 
targeting  signal,  ARGVR,  for  hSFNa  (17).  B,  sequences  of  the  putative  splice  site  in  hSFN  (accession  number  AF151872)  and  hSFNa  (AL110239) 
mRNAs.  The  first  nucleotide  of  the  respective  protein  coding  sequence  is  designated  as  + 1.  The  first  codon  of  hSFN  is  boxed.  The  differences  in 
the  two  sequences  are  underlined. 


EXPERIMENTAL  PROCEDURES 

Identification  of  SFN  and  Sfn o  cDNAs — The  EST  data  base  was 
searched  using  the  TBLASTN  program  to  identify  human  cDNA  clones 
that  encode  a  protein  homologous  to  the  E.  coli  Om  protein  (GenBank 
accession  number  P39287).  IMAGE  clone  663460  was  chosen,  se¬ 
quenced  (11),  and  found  to  contain  an  open  reading  frame  encoding  a 
205-amino  acid  protein.  This  protein  is  identical  to  CGI-114  protein 
(accession  number  is  AF151872).  The  SFN  a  cDNA  was  found  in  the 
GenBank  data  base  (accession  number  AL110239). 

Chromosomal  Localization  of  the  Human  SFN  Gene — The  sequence 
at  the  5'  end  (accession  number  AA224296)  or  the  3'  end  (accession 
number  AA224194)  of  the  SFN  gene,  when  compared  with  the  “Se¬ 
quenced  Tagged  Site”  data  base  at  National  Cancer  Biology  Institute, 


identified  STS-H98169  as  the  genetic  locus  of  SFN.  This  sequence- 
tagged  site  is  positioned  at  chromosome  11  in  a  region  between  markers 
D11S1347  and  D11S939  (110.3-117.9  centimorgan),  which  corresponds 
to  the  region  Ilq23.1-llq23.2. 

Buffers  and  Reagents — All  reagents  were  purchased  from  Sigma 
unless  otherwise  indicated.  Restriction  enzymes  were  purchased  from 
New  England  Biolabs.  Labeled  nucleotides  were  from  Amersham  Phar¬ 
macia  Biotech.  Spectrophotometrie  grade  glycerol  was  obtained  from 
Fisher.  01io(dT)4_22  Ladder  was  purchased  from  Life  Technologies,  Inc. 
and  consists  of  single  stranded  DNA  oligos  from  4  to  22  nt  in  length, 
increasing  by  1-nucleotide  increments.  DNA  oligos  were  obtained  from 
Operon  Biotechnologies  (Alameda,  CA).  Synthetic  RNA  was  obtained 
from  Dharmacon  Research  (Boulder,  CO).  L  buffer  (Lysis  buffer)  con- 


tains  50  mM  Tris-HCl,  pH  7.9,  50  mM  KC1,  20%  glycerol,  and  1  mM 
phenylmethylsulfonyl  fluoride.  W5  buffer  is  50  mM  Tris-HCl,  pH  7.9, 
500  mM  NaCl,  20%  glycerol,  5  mM  imidazole,  and  1  mM  phenylmethyl¬ 
sulfonyl  fluoride.  W40  is  50  mM  Tris-HCl,  pH  7.9,  500  mM  NaCl,  20% 
glycerol,  40  mM  imidazole,  and  1  mM  phenylmethylsulfonyl  fluoride. 
E250  buffer  is  50  mM  Tris-HCl,  pH  7.9, 250  mM  NaCl,  20%  glycerol,  250 
mM  imidazole,  and  1  mM  phenylmethylsulfonyl  fluoride.  All  pH  values 
were  determined  at  21  °C. 

Purification  of  Recombinant  Sfn-His  Protein — The  Sfn  coding  region 
was  PCR  amplified  using  primers  NC05'HYJER  (5'-GCATGCCTG- 
GCGGCAGGGGAGAGCAT)  and  HIND3’YJER  (3’-GCAGTAAGCT- 
TACTCACGGTCTTCTCAT),  and  subcloned  after  digestion  into  the 
Ncol  and  ffindlll  sites  of  pET28d  (Novagen,  Madison,  WI)  to  generate 
phyjeR-His.  This  construct  allows  for  expression  of  six  histidine  resi¬ 
dues  on  the  carboxyl  terminus  under  the  control  of  a  T7  ENA  polym¬ 
erase  promoter.  phyjeR-His  plasmid  was  sequenced  and  no  PCR  errors 
were  found. 

The  phyjeR-His  plasmid  was  transformed  into  BL21(DE3)/pLysS  E. 
coli  strain  (Novagen).  An  overnight  culture  of  100  ml  was  grown  at 
37  °C  in  LB  (1%  bactotryptone,  0.5%  bactoyeast  extract,  and  1%  NaCl) 
with  50  pg/ml  kanamycin  and  25  p g/ml  chloramphenicol.  The  overnight 
culture  was  used  to  inoculate  1  liter  of  the  same  medium.  This  culture 
was  grown  at  37  °C  with  vigorous  aeration  until  the  A550  „  was  0.8. 
Isopropyl- l-thio-0-galactopyranoside  (final  concentration  of  1  mM)  was 
then  added  to  induce  Sfti-His  protein  expression.  At  4  h  after  induction, 
cells  were  harvested.  The  cell  paste  was  resuspended  in  30  ml  of  L 
buffer.  The  cell  suspension  was  sonicated  using  a  Misonix  XL  sonicator 
by  three  1-min  bursts  with  a  microtip  at  the  maximum  setting.  The  cell 
lysate  was  centrifuged  at  27,000  X  g  for  30  min  at  4  °C.  To  the  super¬ 
natant,  polyethyleneimine  was  added  slowly  with  constant  stirring  to  a 
final  concentration  of  0.25%  to  remove  nucleic  acids  (12).  The  suspen¬ 
sion  was  centrifuged  and  the  supernatant  was  loaded  at  1.5  ml/min  onto 
a  2-ml  S2  cation  exchange  column  equilibrated  with  L  buffer  using  the 
BioLogic  Workstation  FPLC  system  (Bio-Rad).  Fractionation  over  the 
cation  exchange  column  removed  the  contaminating  proteins  found  to 
bind  nonspecifically  to  the  subsequent  Ni  affinity  column.  The  S2  flow¬ 
through  was  collected,  and  NaCl  and  imidazole  were  added  to  final 
concentrations  of  500  and  5  mM,  respectively.  The  S2  flow-through  was 
then  incubated  with  2  ml  of  Ni-NTA  resin  (Qiagen,  Santa  Clarita,  CA) 
for  1  h  at  4  °C  with  gentle  rocking.  This  suspension  was  centrifuged, 
and  the  resin  was  washed  2  times  each  with  20  ml  of  W5  buffer,  followed 
by  4  washes  each  with  10  ml  of  W40  buffer.  Sfn-His  proteins  were  then 
eluted  from  the  Ni-NTA  resin  4  times,  each  with  2  ml  of  E250  buffer, 
dialyzed  overnight  against  L  buffer  containing  0.1  mM  dithiothreitol, 
and  stored  at  -70  °C.  Concentrations  of  protein  solutions  were  deter¬ 
mined  by  measuring  the  absorbance  at  280  nm  and  using  the  theoret¬ 
ical  molar  extinction  coefficient  E2S0  nm  of  25,940  M-1  cm-1  calculated 
for  Sfn-His  (13). 

Analytical  Gel  Filtration  Chromatography — Sfn-His  protein  was  sep¬ 
arated  on  a  7.8  X  300-mm  BioSil  SEC  125-5  gel  filtration  column  from 
Bio-Rad.  The  running  buffer  was  50  mM  Tris-HCl,  pH  7.9,  100  mM 
ammonium  sulfate,  5%  glycerol,  0.1  mM  EDTA,  and  0.1  mM  dithiothre¬ 
itol.  The  flow  rate  was  0.25  ml/min.  Size  markers  used  for  calibration 
were  thyroglobulin  (670  kDa),  y-globulin  (158  kDa),  ovalbumin  (44 
kDa),  myoglobin  (17  kDa),  and  vitamin  B-12  (1.35  kDa).  Proteins  were 
detected  by  ultraviolet  absorbance  at  280  nm. 


Site-directed  Mutagenesis  of  the  SFN  cDNA — Mutagenesis  was  per¬ 
formed  using  the  overlapped  PCR  method  as  described  (14).  The  site- 
specific  mutant  oligo  used  was  D136A,  5 '  -TATAGAATAATTGCTGT- 
GAGCACTGTTAAG-3',  with  the  mutated  codon  underlined.  We  chose 
to  mutate  Asp136  because  a  structurally  equivalent  mutation  in  E.  coli 
DNA  pol  I  (D501A)  caused  a  13,000-fold  decrease  in  3'-5'  exonuclease 
activity  (15).  The  final  SFN  mutant  PCR  product  was  digested  with 
Ncol  and  ffindlll  and  subcloned  into  the  same  restriction  sites  of 
pET28d.  This  construct  was  sequenced  as  described  (11). 

Nuclease  Assays — Various  DNA  and  RNA  substrates  were  labeled  at 
the  5'  end  by  T4  polynucleotide  kinase  with  [y-32P]ATP  (16).  RNA5  is 
5'-GAUCG-3'.  DNA5  is  5'-GATCG-3'.  DNA8  is  5'-CACGAGCC-3'.  100 
finol  of  nucleic  acid  substrate  was  incubated  with  100-2000  fmol  of 
Sih-His  protein  at  37  °C  in  10  pi  of  50  mM  K-HEPES,  pH  7.4,  10% 
glycerol,  50  mM  KC1,  10  mM  MnClj,  (unless  otherwise  noted),  0.01% 
Triton  X-100,  and  0.1  mM  dithiothreitol.  The  reactions  were  stopped 
with  10  pi  of  80%  formamide  dye  solution,  heated  at  80  °C  for  3  min, 
and  then  fractionated  on  a  20  or  a  22.5%  8  M  urea  polyacrylamide  gel. 
The  gel  was  exposed  to  x-ray  film.  Visualization  of  the  labeled  substrate 
on  the  gels  was  also  achieved  using  a  Molecular  Dynamics  (Sunnyvale, 
CA)  STORM  860  Phosphorimager  and  quantitative  analysis  was  per¬ 
formed  using  Molecular  Dynamics  ImageQuant  vl.ll  software. 

Kinetics  of  Sfn  Exonuclease  Activity  on  Single  Stranded  DNA  and 
RNA  Substrates — Nuclease  assays  were  performed  under  standard 
buffer  conditions  as  described  above  at  37  °C,  with  Sfn  at  a  final  con¬ 
centration  of  10  nM.  The  reaction  was  incubated  for  5  min  with  RNA5  or 
40  min  with  DNA5,  and  was  within  the  linear  range  of  enzymatic 
activity  (i.e.  ^15%  of  the  substrate  was  converted  to  product).  The 
substrate  concentration  range  was  0.1-3.2  pM.  The  apparent  Michaelis- 
Menten  constant  ( Km )  and  maximal  velocity  (Vmal)  were  obtained  from 
a  double-reciprocal  Lineweaver-Burk  plot.  A  linear  plot  of  1/[S]  versus 
W  (where  [SI  is  the  substrate  concentration  in  pM  and  V  is  the  velocity 
in  pM  min-1  units)  produces  a  slope  of  Krn/Vmn,  and  a  y  intercept  of 
1/V„„.  Acat  was  calculated  from  the  equation  (£„„,)(£,)  =  VmaI,  where  E, 
is  the  total  concentration  of  enzyme  in  the  assay.  Linear  regression 
analysis  was  performed  using  CricketGraph  software  (Cricket  Soft¬ 
ware,  Philadelphia,  PA). 

Northern  Blot  Analysis — Northern  blots  were  prehybridized  for  3  h 
at  65  °C  in  ExpressHyb  Solution  (CLONTECH,  Palo  Alto,  CA).  The 
cDNA  probe  (Sfn  PCR  product  described  above)  was  labeled  using  the 
Megaprime  DNA  Labeling  System  (Amersham  Pharmacia  Biotech)  and 
[a-32P]dCTP  (Amersham  Pharmacia  Biotech),  and  hybridized  in  Ex¬ 
pressHyb  Solution  at  65  °C  for  2-3  h.  Blots  were  washed  once  at  room 
temperature  for  30  min  and  twice  at  50  °C  for  30  min  with  50  mM 
NaP04,  pH  7.4,  0.5%  SDS,  and  1  mM  EDTA.  Images  were  exposed  on 
Kodak  Bio-Max  MS  film  for  16  h  and  developed.  Blots  were  normalized 
with  0-actin  transcripts. 

RESULTS 

The  Sfn  cDNA,  Genomic  Location,  and  mRNA  Expression 
Pattern — A  cDNA  containing  the  SFN  coding  region  was  ob¬ 
tained  from  the  EST  data  base  (see  “Experimental  Proce¬ 
dures”).  By  blasting  the  sequence  of  the  5'  end  (accession  num¬ 
ber  AA224296)  or  the  3'  end  (accession  number  AA224194)  of 
the  SFN  cDNA  against  the  NCBI  UniGene  data  base,  we  found 
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that  the  human  SFN  gene  mapped  to  chromosome  11  at  posi¬ 
tion  Ilq23.1-llq23.2.  Four  additional  genes  also  map  within 
this  vicinity:  apolipoprotein  A-I  ( apoA-I ),  human  serotonin  re¬ 
ceptor  3  ( HTR3 ),  human  nicotinamide  AT-methyl transferase 
( NNMT ),  and  zinc  finger  protein  ZNF259  ( PLZF ). 

The  SFN  cDNA  codes  for  a  205-amino  acid  protein  of  23,754 
daltons,  with  a  theoretical  pi  of  5.6.  Sfn  belongs  to  the  yjeRI 
ORN  family,  which  is  a  distinct  subgroup  of  the  3'  to  5'  exonu¬ 
clease  superfamily  (1,  5).  Fig.  1  shows  a  comparison  of  the 
amino  acid  sequence  of  the  Sfh-like  proteins  from  bacteria, 
yeast,  plant,  worm,  mouse,  and  human.  The  human  Sfn  protein 
is  ~50%  identical  to  its  E.  coli  counterpart,  the  Orn  protein  (5). 
Sfn  and  E.  coli  Orn  possess  the  three  characteristic  sequence 
motifs,  termed  Exo  I,  II,  and  III  (2),  of  the  3'  to  5'  exonuclease 
superfamily  (1)  (Fig.  1A).  Four  conserved  negatively  charged 
residues  within  these  three  Exo  motifs  (shown  in  Fig.  1A)  are 
involved  in  positioning  of  the  two  divalent  cations  required  for 
catalysis  and  phosphodiester  bond  cleavage  (reviewed  in  Ref. 
4). 

Upon  further  examination  of  the  GenBank  data  base,  an 
identical  protein  to  hSfn,  but  with  an  extended  NH2-terminal 
domain,  was  identified  (Fig.  1A).  This  protein,  which  we  pro¬ 
pose  to  call  Sfna,  appears  to  be  the  human  equivalent  to  the 
yeast  Ynt20  protein  and  to  have  arisen  from  an  alternative 
RNA  splicing  event  where  two  additional  nucleotides  were 
introduced  (Fig.  IB).  Using  the  PSORT  II  computer  search 
program,  a  putative  consensus  cleavage  site  motif  for  mito¬ 
chondrial  processing  proteases  (ARGVR)  (17)  was  identified 
within  the  unique  NH2-terminal  portion  of  Sfna.  Both  Sfn  and 
Sfna  contain  a  consensus  nuclear  localization  signal  (KKRK) 
(18)  (Fig.  1A).  It  seems  logical  that  Sfn  would  be  targeted  to  the 
nucleus  and  that  Sfna  would  translocate  predominantly  to  the 
mitochondria. 

Northern  blotting  revealed  that  the  Sfn  cDNA  probe  detects 
a  single  transcript  of  ~1  kilobase  in  all  fetal  and  adult  tissues 
F2  examined  (Fig.  2).  Comparatively,  relatively  low  mRNA  levels 
are  observed  in  adult  lymph  nodes,  brain,  lung,  liver,  spleen, 
and  thymus,  with  highest  levels  observed  in  heart.  Since  the 
human  SFN  and  SFNa  cDNAs  appear  to  differ  by  only  two 
nucleotides,  it  was  not  possible  to  distinguish  the  two  tran¬ 
scripts  here.  Of  the  10  human  ESTs  found  in  the  data  base,  8 
contained  a  sequence  identical  to  the  SFN  cDNA  and  two 
maintained  a  sequence  identical  to  SFNa. 

Purification  of  Overproduced  Sfn-His  Fusion  Protein — We 
have  characterized  here  Sfn,  and  expect  that  Sfna  will  exhibit 
similar  substrate  specificity  as  it  maintains  the  same  core 
nuclease  domain  (Fig.  1A).  The  SFN  gene  was  subcloned  into 
the  pET28d  plasmid  to  produce  a  COOH-terminal-tagged  Sfn- 
His  fusion  protein,  and  the  recombinant  protein  was  purified  as 
described  under  “Experimental  Procedures.”  From  1  liter  of 
induced  bacterial  culture  (about  5  g  of  wet  cell  pellet),  we 
obtained  —4  mg  of  Sfn-His  fusion  protein  of  >95%  purity  (Fig. 
F3  3A).  As  shown  in  Fig.  3B,  the  Sfn-His  fusion  protein  migrates 
as  a  single  symmetrical  peak  on  an  analytical  gel  filtration 
column  that  corresponds  to  a  globular  protein  with  a  molecular 
mass  of  ~90  kDa.  Since  the  calculated  molecular  weight  of  Sfn 
is  24,000,  Sfn-His  fusion  protein  appears  to  migrate  as  a  tet- 
ramer.  However,  since  the  elution  position  on  a  gel  filtration 
column  is  dependent  on  size  and  shape  of  the  protein,  it  is 
possible  that  the  Sfn-His  fusion  protein  exists  in  solution  as  a 
rod-shaped  protein  of  lower  oligomeric  state,  since  rod-shaped 
proteins  are  known  to  migrate  slower  than  expected  for  their 
Fn2  molecular  weight.2  Unlike  the  human  Sfn-His  protein,  the  E. 


2  Dr.  R.  Burgess,  University  of  Wiseonsin-Madison,  personal 
communication. 
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Fig.  3.  Purification  of  Sfn-His  protein.  A,  a  15%  polyacrylamide 
SDS  gel  containing  fractions  of  the  various  purification  steps.  Lane  1  is 
10  pi  of  whole  cell  extract  (30  ml);  lane  2  is  10  pi  of  whole  cell  super¬ 
natant  (30  ml);  lane  3  is  10  pi  of  the  supernatant  after  0.25%  polyeth- 
yleneimine  precipitation  (30  ml);  lane  4  is  14  pi  of  the  S2  cation  ex¬ 
change  column  flow-through  (40  ml);  lanes  5-7  are  2.4,  7.2,  and  21.6  pg 
of  Ni-affinity  purified  Sfn-His  (6  ml);  and  lane  8  is  protein  markers. 
Protein  marker  sizes  in  kDa  are  indicated.  The  gel  was  stained  with 
Coomassie  Blue  R-250.  B,  gel  filtration  column  analysis  of  Ni-purified 
Sfn-His.  The  x  axis  is  the  retention  time  (in  min)  of  marker  proteins. 
They  axis  is  log  of  molecular  weight  (log  MW)  of  proteins.  The  positions 
of  the  marker  proteins  are  indicated  on  the  calibration  curve  by  their 
molecular  weights.  The  retention  time  of  Sfii-His  is  indicated  by  the 
arrow  on  the  calibration  curve. 

coli  Orn  protein  was  reported  to  be  a  dimer  based  on  its  gel 
filtration  chromatographic  profile,  although  it  should  be  noted 
that  the  experimental  conditions  differed  slightly  from  those 
used  here  (6). 

Exonuclease  Activity  of  Sfn-His  Protein — Short  fragments  of 
£5  ribonucleotides  were  shown  to  be  the  optimal  substrate  for 
E.  coli  Orn  protein  (6,  10).  Using  a  5-nucleotide  RNA  (RNA5), 
labeled  at  the  5'  end,  as  the  substrate,  we  determined  the 
divalent  metal  cation  preference  of  the  human  Sfn.  At  metal 
concentrations  of  0.01, 0.05, 0.1, 0.5, 1, 5,  and  10  mM,  only  Mg2+ 
and  Mn2+  had  any  stimulatory  effect  on  Sfn-His  exonuclease 
activity,  with  Mg2+  being  ~  10-fold  less  effective  than  Mn2+  at 
10  mM  (Fig.  4A  and  data  not  shown).  Both  human  Sfn-His  and 
E.  coli  Orn  enzymatic  activities  have  a  strong  dependence  on 
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Fig.  4.  Metal  cofactor  requirement,  temperature  optimum, 
and  time  course  of  Sfn-His  RNase  activity.  A,  metal  preference  of 
Sfh-His.  100  finol  of  labeled  RNA5  was  incubated  with  1  pmol  of  Sfh-His 
at  37  °C  for  30  min  in  the  presence  of  10  m»  of  the  different  divalent 
metal  cations  as  indicated.  The  y  axis  is  relative  exonuclease  activity. 
Minimal  detection  for  this  assay  was  £l  finol  of  product  which  is 
equivalent  to  —2.5%  of  the  activity  detected  with  Mn2+  as  shown  for 
Cu2+,  Ca2+,  Ni2+,  and  Zn2+.  B,  temperature  optimum  of  Sfh-His.  100 
fmol  of  labeled  ENA5  was  incubated  with  500  finol  of  Sfh-His  for  3  or  10 
min  at  the  temperature  indicated.  Lane  1  is  no  protein  control.  Lanes  2 
and  3  represent  reactions  performed  at  21  °C;  lanes  4  and  5, 37  °C;  lanes 
6  and  7,  50  °C;  lanes  8  and  9,  65  °C;  and  lanes  10  and  11,  80  °C.  %P  is 
the  percentage  of  labeled  substrate  that  has  been  converted  to  the  final 
mononucleotide  product.  S  is  the  labeled  RNA5  substrate.  P  is  the  final 
product.  Shown  is  a  representative  of  three  experiments.  C,  a  time 
course  of  Sfh-His  RNase  activity  with  5  mM  MnCl2  at  37  °C.  Lane  1  is 
the  no  protein  control.  Lanes  2-5  are  0.2  pM  RNA5  incubated  with  0.02 
(i M  Sfh-His  for  various  times  (in  minutes)  as  indicated  on  top  of  the 
figure. 


(and  a  preference  for)  Mn2+  (Fig.  4A;  Ref.  10).  The  RNase 
activity  of  Sfh-His  also  has  a  temperature  optimum  around 
50  °C  (Fig.  4B),  similar  to  that  of  E.  coli  Om  protein  (10).  Thus 
both  proteins  are  quite  thermostable.  The  RNA-specific  degra¬ 
dation  activity  of  Sfn  at  37  °C  is  ~2-fold  less  than  that  at  50  °C 
(Fig.  4 B).  Since  we  observed  degraded  4-mer  products  prior  to 
detecting  mononucleotide  products  (Fig.  4 C),  Sfh-His  appears, 
as  expected,  to  be  a  3'  to  5'  exonuclease. 

To  determine  whether  short  single  stranded  DNA  can  serve 
as  substrate  for  Sfn-His,  we  used  a  single  stranded  oligo(dT) 


RNA5 


dT4-20 


DNA5 


DNA8 


Lane  1  3  5  7  9  11  13  15 

2  4  6  8  10  12  14  16 

Fig.  5.  Sfh-His  degrades  small  single  stranded  RNAs  and 
DNAs.  All  substrates  were  labeled  at  the  5'  end.  0.01  pM  of  each 
substrate  was  incubated  with  0.2  pM  Sfh-His  protein  for  1  h  at  37  °C  in 
the  presence  of  5  mM  .MnOl..  for  RNAs  and  5  mM  MgCl2  for  DNAs. 
01igo(dT)4_22  ladder  consists  of  single  stranded  DNA  oligos  from  4  to  22 
bases  long,  increasing  by  1  base  increments.  DNA5  and  DNA8  are 
single  stranded  DNA  of  5  and  8  bases  in  length.  Lanes  1, 5, 9,  and  13  are 
the  no  protein  controls.  Lanes  2,  6,  10,  and  14,  5-min  incubations  with 
Sfh-His;  lanes  3,  7,  11,  and  15,  15-min  incubations;  and  lanes  4,  8,  12, 
and  16,  45-min  incubations. 

ladder  from  4  to  22  bases  in  length,  and  single  stranded  DNA 
oligos  of  5  and  8  nt.  As  shown  in  Fig.  5,  Sfn-His  is  capable  of 
degrading  short  single  stranded  DNA,  although  with  less  effi¬ 
ciency  than  5-mer  RNAs.  Furthermore,  its  DNase  activity  is 
inversely  related  to  the  length  of  the  DNA  oligo,  as  seen  for  the 
RNase  activity  of  E.  coli  Om  (19).  The  data  above  is  also 
consistent  with  the  lack  of  DNase  activity  reported  for  Orn  on 
double  stranded  T7  chromosomal  (large)  DNA  (10). 

The  DNase  and  RNase  Activities  Are  Intrinsic  to  Sfn-His — To 
confirm  that  the  observed  DNase  and  RNase  activities  are 
intrinsic  to  the  Sfn-His  protein,  a  catalytically  inactive  Sfh-His 
mutant  was  constructed.  Amino  acid  residue  Asp136  (Fig.  1,  in 
Exo  III  motif)  was  selected  because  an  aspartate  to  alanine 
mutation  at  the  structurally  equivalent  residue  (5)  in  E.  coli 
DNA  pol  I  causes  a  13,000-fold  decrease  in  3'  to  5'  exonuclease 
activity  (14).  The  D136A  Sfn-His  mutant  was  purified  using  the 
same  procedure  as  described  for  wild  type  Sfn-His  protein  (Fig. 

6A).  When  compared  with  wild  type  Sfn-His,  D136A  Sfn-His  F6 
mutant  displayed  a  &50-fold  reduced  exonuclease  activity  for 
both  RNA5  and  DNA5  substrates  (Fig.  6B).  These  data  are 
consistent  with  both  nuclease  activities  being  intrinsic  to  the 
Sfn-His  protein,  and  not  being  the  result  of  protein  contami¬ 
nants.  The  results  also  demonstrate  the  importance  of  this 
acidic  residue  in  the  enzymatic  function  of  the  3'  to  5'  exonu¬ 
clease  family  members  (1). 

We  subsequently  examined  Sfn  nuclease  activity  on  DNA 
structures  known  to  be  products/substrates  of  DNA  repair  pro¬ 
cesses.  As  might  be  expected  from  the  results  above,  in  the 
presence  of  Mn2+  or  Mg2+,  we  did  not  detect  any  protein-de¬ 
pendent  degradation  of  42-base  pair  double-stranded  DNAs  of 
different  configurations,  including  gapped,  nicked,  4-nucleotide 
recessed  or  overhanged  3'  ends,  or  flap  structures  (data  not 
shown;  substrates  described  in  Ref.  20). 

Kinetic  Parameters  of  the  Sfn  Nuclease  Activities — To  deter- 
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Fig.  6.  The  effect  of  a  D136A  mutation  on  the  DNase  and  RNase 
activities  of  Sfn-His.  A,  purity  of  D136A  Sfh-His  mutant  as  deter¬ 
mined  on  a  15%  polyacrylamide  SDS  gel  stained  with  Coomassie  Blue 
B-250  dye.  Lanes  1  and  5  are  protein  markers;  and  lanes  2-4  are  2.5,  5, 
and  10  pg  of  Ni-affinity  purified  D136A  Sfh-His.  Protein  marker  sizes  in 
kilodaltons  are  indicated  to  the  right.  B,  exonuclease  activity  of  D136A 
Sfn-His  mutant  with  RNA5  and  DNA5  single  stranded  substrates. 
Lanes  1-9  are  with  BNA5,  and  lanes  10-18  are  with  DNA5  substrates. 
The  substrate  concentration  is  200  nM.  Lanes  1  and  10  are  the  no 
protein  control.  Lanes  2-5  and  11-14  are  12,  24,  48,  and  96  nM  of 
purified  wild  type  Sfn-His  (WT).  Lanes  6-10  and  15-18  are  12,  24,  48, 
and  96  nM  of  purified  D136A  Sfh-His  mutant.  The  reactions  were 
incubated  at  37  °C  for  20  min. 

mine  the  reason(s)  for  the  different  nuclease  efficiencies  of 
Sfh-His,  we  determined  the  Michaelis-Menten  constant  (Km), 
maximal  velocity  ( Vm ax ) ,  and  the  apparent  rate  of  catalysis 
(k^)  for  comparable  RNA  and  DNA  substrates  (Fig.  7).  The  Km 
of  RNA5  (Km  =  1.56  p.M)  is  essentially  identical  to  that  of  DNA5 
(. Km  =  1.51  pM).  In  contrast,  the  Vmax  and  kCJIt  values  were 
~4-fold  higher  for  RNA5  (Vma3t  =  0.015  pM  min-1,  =  1.5 
min-1)  than  for  DNA5  (Vmsa  =  0.004  pM  min-1,  =  0.39 
min-1).  Our  Km  value  for  RNA5  is  consistent  with  the  micro¬ 
molar  range  reported  for  E.  coli  Om  on  p(A)5  single  stranded 
RNA  substrates  (19). 


DISCUSSION 

The  human  SFN  gene  belongs  to  the  yjeR/ORN  subfamily  of 
the  3'  to  5'  exonuclease  superfamily  previously  described  (1, 5). 
Some  of  the  members  of  this  superfamily,  most  notably  the 
Werner  syndrome  gene  product  and  the  polymyositis-sclero¬ 
derma  overlap  syndrome  100-kDa  autoantigen  (PM-Scl  100), 
are  associated  with  human  disease  (21, 22).  While  a  connection 
of  SFN  to  a  specific  human  disease  is  not  obvious,  the  ubiqui¬ 
tous  expression  of  its  transcript  may  suggest  a  general  and 
essential  role  for  the  encoded  protein  in  mammalian  cells. 
Notably,  the  human  SFN  gene  maps  to  chromosome  position 
Ilq23.1-llq23.2,  a  region  that  undergoes  translocation  events 
in  several  leukemias  (23,  24),  although  none  of  these  translo¬ 
cation  breakpoints  have  been  finely  mapped  to  SFN. 


Fig.  7.  Double  reciprocal  Lineweaver-Burk  plot  of  Sfh  exonu¬ 
clease  activity  on  5-nt  single  stranded  RNA5  and  DNA5.  The  x 

axis  is  the  reciprocal  of  substrate  (1/S)  in  (pM)-1  units.  They  axis  is  the 
reciprocal  of  velocity  (1/V)  in  (pM  X  min-1)-1  units.  ■  represents  RNA5; 
•,  DNA5  substrate.  The  data  presented  here  is  the  average  of  five 
independent  experiments  for  RNA5  and  of  four  independent  experi¬ 
ments  for  DNA5  in  the  presence  of  its  preferred  metal  cation  Mn2  ’ . 
Error  bars  represent  the  standard  deviation.  Linear  regression  analysis 
showed  a  correlation  coefficient  of  0.997  for  RNA5  and  0.986  for  DNA5. 

The  unique  NH2  terminus  of  hSfnn  contains  a  consensus 
cleavage  site  pattern  for  mitochondrial  processing  proteases 
(17),  whereas  both  Sfna  and  Sfn  possess  a  nuclear  targeting 
signal  in  their  COOH-terminal  domains.  This  observation  sug¬ 
gests  that  an  alternative  splicing  event  has  evolved  to  give  rise 
to  a  mitochondrial  (Sfna)  and  a  nuclear  (Sfn)  version  of  the 
Ynt20/Orn  equivalents  in  human.  However,  at  present,  the 
genomic  DNA  sequence  of  the  entire  hSFN  gene  is  not  avail¬ 
able,  and  thus  the  alternatively  spliced  products  cannot  be 
confirmed.  Notably,  in  the  mouse  EST  data  base,  there  are  also 
multiple  transcripts,  suggesting  that  alternatively  spliced  mu¬ 
rine  SFN  transcripts  exist  as  well.  Future  studies  will  need  to 
address  whether  there  is  tissue-specific  expression  of  the 
mRNA  splice  variants  found  in  mammals.  Interestingly,  while 
S.  cerevisiae  maintains  a  homologue  to  hSFNa  (called  YNT20 
or  Rex2),  we  were  unable  to  find  a  homologue  to  SFN  in  the 
NCBI  data  base  using  the  Blastp  search  program. 

A  knockout  of  the  E.  coli  SFN  homologue,  the  ORN  gene, 
results  in  cellular  lethality  (7).  An  E.  coli  temperature-sensi¬ 
tive  mutant  is  not  only  lethal  at  the  nonpermissive  tempera¬ 
ture  (i.e.  where  Orn  is  inactive),  but  accumulates  small  oligori- 
bonucleotides,  indicating  that  Om  maintains  an  essential 
activity  to  degrade  RNA  (8).  Three  possibilities  were  provided 
as  to  why  ORN  deletion  mutants  are  inviable  (8):  1)  accumu¬ 
lation  of  oligoribonucleotides  results  in  a  depletion  of  cellular 
mononucleotides;  2)  accumulated  oligoribonucleotides  inhibit 
certain  enzymes  and  interfere  with  essential  metabolic  pro¬ 
cesses;  or  3)  Orn  has  an  additional  unknown  function  that  is 
responsible  for  the  growth  cessation.  Since  Sfh  is  capable  of 
degrading  both  RNA  and  DNA,  we  propose  that  the  human 
protein  operates  to  remove  not  only  short  RNAs  (likely  result¬ 
ing  from  RNA  degradation  processes)  (25,  26),  but  also  short 
single  stranded  DNAs  that  might  arise  as  products  of  DNA 
repair  and  recombination.  Thus,  by  extension  (8),  Sfn  would 
function  globally  to  recycle  nucleotides  or  to  remove  nucleic 
acids  that  may  interfere  with  essential  cellular  processes.  How 
the  nuclease  activities  of  the  Orn/Ynt20  proteins  would  func¬ 
tion  in  mitochondrial  DNA  escape,  a  process  that  involves 
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translocation  of  mitochondrial  DNA  to  the  nucleus  (9)  and  is 
possibly  linked  to  cellular  aging  or  senescence  (27),  is  presently 
unclear.  Furthermore,  although  unlikely,  whether  the  NH2- 
terminal  differences  between  these  proteins  affect  substrate 
specificity  will  need  to  be  determined. 

Not  surprisingly,  Sfn-His,  which  shares  ~50%  identity  in 
amino  acid  sequence  to  its  bacterial  counterpart,  has  very 
similar  biochemical  properties  to  Orn  (5).  In  particular,  both 
proteins  degrade  short  RNAs  in  a  3'  to  5'  direction,  prefer 
Mn2+  as  its  metal  cofactor,  and  have  an  enzymatic  activity 
temperature  optimum  of  ~50  °C.  However,  gel  filtration  chro¬ 
matography  suggests  that  the  human  Sfh-His  fusion  protein  is 
a  tetramer,  in  contrast  to  E.  coli  Orn,  which  exists  as  a  ho¬ 
modimer  (6),  suggesting  that  these  proteins  may  maintain 
differing  activities  as  well. 

The  nuclease  activities  of  both  Sfn  and  Orn  (19)  are  inversely 
proportional  to  the  length  of  the  single  stranded  substrate.  Fur¬ 
thermore,  previous  kinetic  data  of  E.  coli  Orn  has  shown  that  the 
Km  values  for  short  and  long  single  stranded  RNAs  are  similar 
(19).  We  observed  that  Sfn  has  a  similar  Km  for  short  single 
stranded  RNAs  and  DNAs,  but  that  such  short  RNAs  are  de¬ 
graded  ~4-fold  more  efficiently  than  DNAs.  Such  experimental 
observations  raise  questions  of  how  the  substrate  length  or  the 
nucleic  acid  chemistry  influences  Sfn/Om  enzymatic  activity.  To 
answer  these  questions  will  require  determining  which  step(s)  is 
influenced  by  nucleotide  length  or  nucleic  add  composition.  High 
resolution  structural  data  of  Sfn,  alone  and  in  complex  with  RNA 
and  DNA  would  also  shed  light  on  the  recognition  and  catalytic 
mechanisms  of  these  proteins.  Lastly,  our  studies  emphasize  that 
future  experiments  should  pay  particular  attention  to  the  poten¬ 
tial  range  of  substrate  diversity  recognized  by  the  other  3'  to  5' 
exonuclease  superfamily  members. 
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SUMMARY 


The  human  HEM45/ISG20  gene  is  directly  regulated  by 
estrogen  hormone  and  interferon.  Amino  acid  sequence  comparison 
suggests  that  this  gene  encodes  a  protein  with  homology  to  the  3’  to 
5’  exonuclease  superfamily  that  includes  RNases  (such  as  RNase  T 
and  D),  the  proofreading  domains  of  DNA  polymerase  I  family.  We 
provided  here  direct  biochemical  evidence  that  HEM45/ISG20  gene 
product  acts  as  a  3’  to  5’  exonuclease  in  vitro.  This  protein 
degrades  RNA  and  single  stranded  DNA  with  ~35  fold  less  efficiency. 
Hem45/Isg20  prefers  Mn2+  as  metal  cofactor  and  displays  pH  optima 
of  ~7.0.  Mutation  of  a  conserved  aspartate  (D94)  to  glycine  reduces 
both  nuclease  activities  of  Hem45/Isg20.  In  addition  to  RNase  L, 
Hem45/Isg20  is  the  second  known  interferon-regulated  RNase. 
Hem45/Isg20  is  located  in  PML  nuclear  bodies  which  are  known 
sites  of  transcription  (and  replication)  of  not  only  oncogenic  DNA 
viruses  but  also  cellular  hormonal-dependent  RNA  polymerase  II 
transcription.  We  proposed  that  the  biological  role  of  Hem45/Isg20 
is  to  degrade  not  only  viral  RNAs  as  part  of  the  cellular  interferon 
anti-viral  response,  but  also  estrogen  regulated  cellular  mRNAs  as 
part  of  the  down  regulation  of  estrogen  signaling  process. 
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INTRODUCTION 


HEM45  (HeLa  Estrogen  Modulated,  band  45)  gene,  also  called 
ISG20  (Interferon  Stimulated  Gene  product  of  20  kDa),  is 
ubiquitously  expressed  as  a  ~0.7  kb  transcript  at  low  level  in 
numerous  immortal  human  cell  lines,  in  various  human  tissues,  and 
also  in  rat  immature  uterus  (1,  2).  Transcription  of  the 
HEM45/ISG20  gene  is  directly  regulated  by  estrogen  hormone  (2). 
Estrogen  hormones  cause  HEM45/ISG20  mRNA  level  to  increase  -20- 
fold  in  human  HeLa  cells  expressing  the  estrogen  receptor  alpha 
gene  and  to  increase  maximally  8-fold  in  rat  uterus  six  hours  after 
estrogen  treatment  (2).  Estrogens  are  important  in  normal  uterine 
development  (2). 

The  HEM45/ISG20  gene  is  also  induced  directly  by  interferons 
at  the  transcription  level  (1).  Interferons  are  a  family  of  secreted 
cellular  proteins  that  are  involved  in  the  regulation  of  cell 
proliferation,  cellular  differentiation,  immune  response,  and 
antiviral  response  (reviewed  in  3).  The  HEM45/ISG20  gene  has  been 
shown  to  localize  to  chromosome  position  15q26  by  in  situ 
hybridization  (4),  while  the  Hem45/Isg20  proteins,  like  some  other 
interferon-induced  proteins,  are  located  in  spherical  nuclear  bodies 
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termed  promyoletic  (PML)  oncogenic  domains  (PODs)  [1].  PODs 
(also  known  as  nuclear  domain  10  or  Kr  bodies)  are  distinct  from 
other  subnuclear  domains  such  as  nucleolus  and  splice  sites  (5). 
PODs  are  composed  of  large  multiprotein  complexes  associated  with 
the  nuclear  matrix  (5).  They  represent  preferential  targets  for  viral 
infection  (6)  and  interferon  action,  and  their  substructure  integrity 
are  compromised  in  certain  human  diseases  including  leukemia  and 
viral  infections  (6). 

HEM45/ISG20  shows  amino  acid  sequence  homology  to  the 
carboxyl  terminus  of  XPMC2  gene  of  human  and  of  Xenopus  laevis 
(1,  7).  XPMC2  appears  to  be  a  negative  regulator  of  cell  division 
involving  in  the  control  mechanism  that  prevents  the  onset  of 
mitosis  until  DNA  replication  is  completed.  XPMC2  is  able  to 
prevent  mitotic  catastrophe  in  Schizosaccharomyces  pombe  strain 
that  lack  inhibitors  of  the  G2  to  M  transition  of  the  cell  cycle  (8). 
Xpmc2  is  localized  to  the  nucleus  of  Xenopus  eggs  and  was  found  to 
be  a  substrate  for  MPF  (8). 

The  HEM45/ISG20  gene  was  recently  placed  into  the  3'  to  5" 
exonuclease  superfamily  (9)  because  it  contains  conserved 
exonuclease  motifs.  Homology  within  the  superfamily  is  centered 
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around  three  conserved  exonuclease  motifs  termed  Exo  I,  II  and  III 


(10).  The  crystal  structure  of  the  Klenow  fragment  of  Escherichia 
coli  DNA  polymerase  I  and  complementing  site-directed  mutagenesis 
studies  indicate  that  the  Exo  I,  II,  and  III  motifs  are  clustered  around 
the  active  site  and  contain  four  conserved  negatively  charged 
residues  that  are  critical  for  coordinating  the  two  metal  ions 
involved  in  phosphodiester  bond  cleavage  (reviewed  in  11).  This 
superfamily  includes  RNases  (such  as  RNaseT  and  D),  the 
proofreading  domains  of  pol  I-family  DNA  polymerases,  and  DNases 
that  exist  as  independent  proteins  (such  as  Escherichia  coli  Exol)  or 
as  domains  within  larger  polypeptides  (such  as  a  region  within  the 
Werner's  syndrome  helicase)  [9,  10,  11]. 

We  reported  here  the  biochemical  evidence  that  Hem45/Isg20 
is  a  3’  to  5’  exonuclease  with  a  strong  preference  for  RNAs  over 
single  stranded  DNAs.  We  discussed  the  biological  role 
Hem45/Isg20  may  play  in  the  cellular  interferon-dependent 
antiviral  response  and  estrogen  hormonal  signaling  process. 
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EXPERIMENTAL  PROCEDURES 


Buffers  and  Reagents- All  reagents  were  purchased  from  Sigma 
unless  otherwise  indicated.  Restriction  enzymes  were  purchased 
from  New  England  Biolabs.  Labeled  nucleotides  were  from 
Amersham.  Spectrophotometric-grade  glycerol  was  obtained  from 
Fisher.  pETs  vectors  were  from  Novagen  (Madison,  WI).  DNA  oligos 
were  obtained  from  Operon  Biotechnologies  (Alameda,  CA). 

Synthetic  RNA  was  obtained  from  Dharmacon  Research  (Boulder, 
CO).  L  buffer  (Lysis  buffer)  contains  50  mM  Tris-HCl  pH  7.0,  50  mM 
NaCl,  20%  glycerol,  20  mM  p-mercaptoethanol,  and  1  mM 
phenylmethylsulfonylflouride  (PMSF).  W5  buffer  is  50  mM  Tris-HCl 
pH  7.9,  500  mM  NaCl,  20%  glycerol,  5  mM  imidazole,  10  mM  (3- 
mercaptoethanol,  and  1  mM  PMSF.  W20  is  50  mM  Tris-HCl  pH  7.9, 
500  mM  NaCl,  20%  glycerol,  20  mM  imidazole,  10  mM  P- 
mercaptoethanol,  and  1  mM  PMSF.  E250  buffer  is  50  mM  Tris-HCl 
pH  7.9,  250  mM  NaCl,  20%  glycerol,  250  mM  imidazole,  10  mM  P- 
mercaptoethanol,  and  1  mM  PMSF.  All  pH  values  were  determined 
at  21°C. 

Purification  of  recombinant  Hem45/Isg20-His  protein-The 
HEM45/ISG20  coding  region  was  PCR  amplified  using  primers 
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NC05HEM45  (5  "-GC ATGC AGA ATTCGGCACGAGCTCTGA-3  ’ )  and 
HIND3  'HEM45  (5  '-GC  AGTAAGCTTTCAGTCTGAC  ACAGCCAG-3  ’ ), 
partially  digested  with  Ncol  and  Hindlll,  and  sub-cloned  after 
digestion  into  the  Ncol  and  Hindlll  sites  of  pET28d  (Novagen, 
Madison,  WI)  to  generate  pHem45/Isg20-His.  This  construct  allows 
for  expression  of  six  histidine  residues  on  the  carboxyl  terminus  of 
the  Hem45/Isg20  protein  under  the  control  of  a  T7  RNA  polymerase 
promoter.  pHem45/Isg20-His  plasmid  was  sequenced  (12)  and  no 
PCR  errors  were  found. 

pHem45/Isg20D94G  plasmid  contains  the  HEM45/ISG20  gene 
with  D94G  mutation  in  the  ExoII  exonuclease  motif  (9).  A  145  bp 
PstI  fragment  containing  this  mutation  was  gel  isolated  from  a  PstI 
digest  of  pGST-ISG20ExoII  (obtained  from  Dr.  Mechti)  and  subcloned 
into  the  internal  PstI  sites  within  the  HEM45/ISG20  protein  coding 
region  of  pHem45/Isg20-His  plasmid. 

The  pHem45/Isg20-His  or  pHem45/Isg20D94G  plasmid  was 
transformed  into  BL21(DE3)/pLysS  E.coli  strain  (Novagen).  An 
overnight  culture  of  100  ml  was  grown  at  37°C  in  LB  (1%  bacto- 
tryptone,  0.5%  bacto-yeast  extract,  and  1%  NaCl)  with  50  pg/ml 
kanamycin  and  25  pg/ml  chloramphenicol.  The  overnight  culture 
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was  used  to  inoculate  2  liter  of  the  same  medium.  This  culture  was 


grown  at  37°C  with  vigorous  aeration  until  the  A550nm  was  ~0.6. 
Isopropyl- 1-thio-p-galactopyranoside  (final  concentration  of  1  mM) 
was  then  added  to  induce  Hem45/Isg20-His  protein  expression. 

After  4  hours  of  induction,  cells  were  harvested,  and  cell  pellets 
were  frozen  at  -20°C.  pHem45/Isg20-His  plasmid,  but  not 
pHem45D94G  plasmid,  causes  -2-3  fold  decrease  of  growth  rate  of 
BL21(DE3)  cells  even  in  the  presence  of  plysS  plasmid  at  the 
uninduced  state  (data  not  shown),  suggesting  a  role  for  nuclease 
activity  in  growth  efficiency.  The  cell  pellet  was  resuspended  in  40 
ml  of  L  buffer.  The  cell  suspension  was  sonicated  using  a  Misonix  XL 
sonicator  by  two  1  minute  bursts  with  a  microtip  at  the  maximum 
setting.  The  cell  lysate  was  centrifuged  at  27,000g  for  30  min  at 
4°C.  To  the  supernatant,  polyethyleneimine  (PEI)  was  added  slowly 
with  constant  stirring  to  a  final  concentration  of  0.15%  to  remove 
nucleic  acids  (13).  The  suspension  was  centrifuged  and  to  the 
supernatant,  NaCl  was  added  to  a  final  concentration  of  500  m  M 
and  imidazole  was  added  to  a  final  concentration  of  5  mM.  The 
eluant  was  then  incubated  with  2  ml  of  Ni-NTA  resin  (Qiagen,  Santa 
Clarita,  CA)  for  1  hr  at  4°C  with  gentle  rocking.  This  suspension  was 
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centrifuged,  and  the  resin  was  washed  2  times  each  with  20  ml  W5 
buffer,  followed  by  2  washes  each  with  20  ml  of  W20  buffer. 
Hem45/Isg20-His  proteins  were  then  eluted  from  the  Ni-NTA  resin  4 
times,  each  with  2  ml  of  E250  buffer,  dialyzed  overnight  against  L 
buffer  containing  0.1  mM  DTT  and  50%  glycerol,  and  stored  at 
-20°C.  Concentrations  of  protein  solutions  were  determined  by 
measuring  the  absorbance  at  280nm  and  using  the  theoretical  molar 
extinction  coefficient  E28onm  of  13,325  M'cnr1  calculated  for 
Hem45/Isg20-His  (14).  From  2  liter  of  induced  bacterial  culture,  we 
obtained  ~1  mg  of  Hem45/Isg20-His  fusion  protein  (Figure  1). 

Nuclease  assays-  Various  DNA  and  RNA5  substrates  were 
labeled  at  the  5’  end  by  T4  polynucleotide  kinase  with  y-P32-ATP 
(15).  RNA5  is  5’-GAUCG-3\  DNA5  is  5’-GATCG-3\  Uniformly 
labeled  RNA  substrates  used  in  Figure  3  were  synthesized  with  T7 
RNA  Polymerase  and  a-P32-GTP  according  to  the  instructions  of  the 
manufacturer  (Gibco-BRL).  Supercoiled  pET-28d  was  used  as  the 
promoter  DNA  template  to  produce  343  base  long  RNA  with  a  15 
base  stem  and  6  base  loop  structure  (TO  transcription  terminator)  at 
the  3’  end  of  the  RNA  (16).  For  the  210  base  long  RNA,  pET-28d 
plasmid  that  has  been  linearized  with  Xhol  was  used  as  the  T7 
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promoter  template,  with  the  Xhol  restriction  site  is  located  between 
the  T7  promoter  and  the  TO  transcription  terminator. 

Labeled  nucleic  acid  substrate  (200-400  fmol)  was  incubated 
with  40-1000  fmol  of  Hem45/Isg20-His  protein  at  37°C  in  10  pil  of 
50  mM  HEPES-KOH  pH  7.0,  10%  glycerol,  50  mM  NaCl,  1  mM  MnCl2, 
0.01%  Triton  X-100,  and  1  mM  DTT.  The  reactions  were  stopped 
with  10  pi  of  80%  formamide  dye  solution,  heated  at  80°C  for  3  min, 
and  then  fractionated  on  a  20%  or  a  22.5%  8  M  urea  polyacrylamide 
gel.  Visualization  of  the  labeled  substrate  on  the  gels  was  also 
achieved  using  a  Molecular  Dynamics  (Sunnyvale,  CA)  STORM  860 
Phosphorimager  and  quantitative  analysis  was  performed  using 
Molecular  Dynamics  ImageQuant  vl.ll  software.  For  the  pH 
titrations,  different  pHs  were  maintained  with  a  buffer  mixture  of 
constant  ionic  strength  composed  of  25  mM  of  acetic  acids  and  MES, 
and  50  mM  of  Trizma  base,  instead  of  25  mM  HEPES-KOH  pH  7.0 
(17).  The  specific  activity  is  defined  as  lU/pg  of  Hem45/Isg20-His. 
1U  is  lpmol  of  RNA5  or  DNA5  substrate  being  degraded  to 
mononucleotide  form  in  1  minute  at  37°C. 
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RESULTS 


Hem45/Isg20-His  is  a  3’  to  5’  exonuclease-Tht  HEM45/ISG20  cDNA 
codes  for  a  181  amino  acid  protein  of  20,363  daltons  with  a 
theoretical  pi  of  9.5.  Hem45/Isg20  belongs  to  the  3’  to  5’ 
exonuclease  superfamily  (9)  based  on  conservation  of  the  three 
characteristic  sequence  motifs  Exol,  II,  and  III  (10,  11). 

To  examine  experimentally  for  nuclease  activity, 
HEM45/ISG20  gene  was  subcloned  into  the  pET28d  plasmid  to 
produce  a  C-terminal-tagged  Hem45/Isg20-His  fusion  protein,  and 
the  recombinant  protein  was  purified  as  described  in  Experimental 
Procedures  (Figure  1). 

As  shown  in  Figure  2,  Hem45/Isg20-His  is  capable  of 
degrading  RNA5,  DNA5,  and  the  single  stranded  DNA  flap  of  the  3’ 
flap  DNA  substrate,  stopping  at  the  double  stranded  DNA  junction. 
Also,  Hem45/Isg20-His  is  -35-fold  more  active  on  single  stranded 
RNA  than  DNA  (Figure  2),  with  a  specific  activity  for  RNA5  of  4.21  ± 
0.80  U/pg,  and  for  DNA5  substrate  of  0.12  ±  0.01  U/pg.  We 
observed  degraded  4  mer  products  prior  to  detecting 
mononucleotide  products  in  a  time  course  with  single  stranded 
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RNA5  or  DNA5  substrate  (Figure  2),  consistent  with  Hem45/Isg20- 


His  as  a  3’  to  5’  exonuclease. 

We  also  examined  Hem45/Isg20-His  RNase  activity  on  longer 
RNA  substrate  with  or  without  a  stem-loop  structure  at  the  3’  end. 

As  shown  in  Figure  3,  Hem45/Isg20-His  is  capable  of  degrading  not 
only  short  RNA  such  as  RNA5,  but  also  210  base  long  RNA. 

Flowever,  the  presence  of  a  stem-loop  structure  at  the  3’  end  of  the 
RNA  substrate  cause  a  ~40  fold  reduction  in  the  RNase  activity  of 
Hem45/Isg20-Flis  at  the  30  or  60  time  point  (Figure  3).  Such  data  is 
consistent  with  the  above  time  course  data  indicating  that 
Hem45/Isg20-His  is  a  3’  to  5’  exonuclease.  The  data  also  suggest 
that  Hem45/Isg20-His  exonuclease  activity  on  RNA  is  processive  and 
not  distributive  since,  in  a  time  course,  we  did  not  detect  many  of 
intermediate  size  products  between  the  starting  RNA  substrate  and 
the  final  mononucleotide  products  (Figure  3). 

We  subsequently  examined  Hem45/Isg20  nuclease  activity  on 
other  DNA  structures  known  to  be  products/substrates  of  DNA 
repair  processes.  The  various  double  stranded  DNA  substrates  we 
used  are  described  in  Ref.  18.  We  did  not  detect  any  protein- 
dependent  degradation  of  42  bp  double-stranded  DNAs  of  gapped, 
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nicked,  or  flushed  structures  (data  not  shown).  Neither  did  we 
detect  any  degradation  of  5’  flap  double  stranded  DNA  structure 
(data  not  shown).  However,  we  did  detect  degradation  of  the  single 
stranded  DNA  region  of  a  3’  overhang  DNA  structure  (data  not 
shown).  Such  data  is  consistent  with  the  results  of  Figure  2, 
indicating  that  Hem45/Isg20-His  degrades  single  stranded  RNA  and 
DNA,  but  not  double  stranded  DNA  structures. 

To  confirm  that  the  observed  DNase  and  RNase  activities  are 
intrinsic  to  the  Hem45/Isg20-His  protein,  a  catalytically  inactive 
Hem45/Isg20-His  mutant  was  constructed.  Amino  acid  residue 
Aspartate94  was  selected  because  an  aspartate  to  alanine  mutation  at 
the  structurally  equivalent  residue  in  Escherichia  coli  DNA  poll 
causes  a  large  decrease  in  3’  to  5’  exonuclease  activity  (18).  The 
Hem45/Isg20D94G  mutant  was  purified  using  the  same  procedure 
as  described  for  wild  type  Hem45/Isg20-His  protein  (Figure  1). 

When  compared  to  wild  type  Hem45/Isg20-His,  the 
Hem45/Isg20D94G  mutant  displayed  a  -90%  reduction  in  degrading 
both  RNA5  and  DNA5  substrates  (Figure  4).  These  data  are 
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consistent  with  both  nuclease  activities  being  intrinsic  to  the 
Hem45/Isg20-His  protein,  and  not  being  the  result  of  contaminants. 
Optimization  of  the  exonuclease  activity  of  Hem45/Isg20-His  fusion 
protein- Using  a  5’  end  labeled  RNA5  as  the  substrate,  we  determined 
the  divalent  metal  cation  preference  of  human  Hem45/Isg20-His 
protein.  At  metal  concentrations  of  0.05,  0.5,  and  5  mM,  only  Mn2+ 
had  any  stimulatory  effect  on  Hem45/Isg20-His  exoribonuclease 
activity,  while  in  the  presence  Mg2+,  Ca2+,  Cu2+,  Zn2+,  and  Ni2+,  there 
was  no  detectable  degradation  of  the  RNA5  substrate  (Table  1).  The 
DNase  activity,  as  determined  with  the  3’  flap  DNA  substrate,  also 
showed  the  same  metal  preference  (data  not  shown).  In  addition, 
both  the  RNase  and  DNase  activities  of  Hem45/Isg20-His  show  the 
same  pH  profile,  with  the  optimal  pH  of  7.0  (Figure  5  and  data  not 
shown). 
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DISCUSSION 


The  human  HEM45/ISG20  gene  is  categorized  as  a  member  of 
the  3’  to  5’  exonuclease  superfamily  (9,  10,  11)  based  on  amino  acid 
sequence  comparisons.  We  presented  here  direct  biochemical 
evidence  that  Hem45/Isg20  is  a  3’  to  5’  exonuclease  with  strong 
preference  for  single  stranded  RNA  over  single  stranded  DNA.  Our 
studies  here,  along  with  the  studies  on  RNase  T  and  Orn  proteins 
(20,  21),  emphasize  that  future  experiments  should  pay  particular 
attention  to  the  potential  range  of  substrate  diversity  recognized  by 
the  other  3’  to  5’  exonuclease  superfamily  members. 

Some  of  the  members  of  this  superfamily,  most  notably  the 
Werner  syndrome  gene  product  (WRN)  and  the  polymyositis- 
scleroderma  overlap  syndrome  100  kDa  autoantigen  (PM-Scl  100), 
are  associated  with  human  disease  (9).  While  a  connection  of 
Hem45/Isg20  to  a  specific  human  disease  is  not  known,  RNases  are 
now  known  to  control  biological  processes  ranging  from  splicing  to 
organogenesis  (reviewed  in  22).  Growth  regulatory  molecules  such 
as  angiogenin,  neurotoxins,  and  plant  allergens  have  RNase  activity 
or  significant  homology  to  known  RNases  (reviewed  in  22). 
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While  the  biological  significance  of  the  weak  DNase  activity  of 
Hem45/Isg20  is  unclear,  its  RNase  activity  may  play  a  role  in  the 
cellular  interferon  antiviral  response.  HEM45/ISG20  is  expressed  at 
low  level  in  all  cell  types  studied  so  far  (1,  2)  and  is  up  regulated  at 
the  transcription  level  by  interferons  (1).  Transient  transfection 
experiments  indicated  that  Hem45/Isg20  is  located  in  the  PML 
nuclear  bodies  (PODs)  near  the  nuclear  matrix  (1).  The  PODs  are 
distinct  from  other  known  nuclear  substructures  such  as  nucleolus 
and  splice  sites.  Such  PODs  are  known  sites  of  transcription  and 
replication  of  oncogenic  DNA  viruses  such  as  Simian  Virus  SV40, 
Adenovirus,  and  Herpes  Simplex  Virus  (6).  These  experimental 
observations,  taken  together,  suggest  that  Hem45/Isg20  is  part  of 
the  cellular  interferon  defense  mechanism  against  viral  infection. 

We  proposed  that  Hem45/Isg20,  given  its  biochemical  activity  of 
degrading  RNAs,  is  involved  in  removing  viral  mRNAs  to  prevent 
viral  gene  expression  and  thereby  viral  propagation. 

Hem45/Isg20  is  the  second  known  RNase  regulated  by 
interferon.  RNase  L,  the  first  interferon-dependent  RNase,  is  also 
constitutively  expressed  at  a  low  level  in  various  cell  lines  like 
Hem45/Isg20  (reviewed  in  3).  However,  in  contrast  to 
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Hem45/Isg20,  RNase  L  is  active  only  in  the  presence  of  its  ligand 
effector,  2’5’linked  poly-A  (reviewed  in  3).  It  is  conceivable  that 
there  may  be  other  protein  or  ligand  inhibitors  that  can  complex 
with  Hem45/Isg20  in  order  to  regulate  its  RNase  activity  since  it  is 
thought  that  RNases  are  typically  present  in  minute  quantities 
bound  with  inhibitors  or  lying  latent  in  cells  awaiting  an  activator 
(reviewed  in  22).  Alternatively,  Hem45/Isg20  activity  can  be 
regulated  by  post-translational  modification.  However,  we  suggest 
that  Hem45/Isg20  RNase  activity  is  probably  regulated  by  localizing 
it  to  a  particular  region  of  the  cell  such  as  PODs  since,  in  transient 
transfection  experiments,  overexpressed  Hem45/Isg20  are  localized 
to  the  PODs  and  such  overexpression  did  not  appear  to  affect  cell 
growth  (1). 

The  estrogen  hormone,  like  interferon,  is  also  capable  of 
transcriptionally  up  regulated  the  HEM45/ISG20  gene  (2).  Recent 
demonstrations  of  nascent  RNA  polymerase  II  transcripts  and  CBP 
(CREB  binding  proteins)  transcription  coactivators  in  PODs  suggest  a 
role  for  PODs  in  transcriptional  regulation  (23).  Specifically,  at  least 
one  POD  associated  protein,  PML,  has  been  shown  to  interact  with 
CBP  and  dramatically  stimulated  hormonal  nuclear  receptor  (such 
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as  retinoid  X  receptor  and  glucocorticoid  receptor)  transcriptional 
activity  (24).  The  hormonal  receptors,  when  complexed  with  their 
respective  hormone  ligand,  act  as  transcription  factors.  These 
results  raise  the  possibility  that  PML  may  also  stimulate  estrogen 

nuclear  receptor  transcriptional  activity.  If  this  is  the  case,  then  it  is 

conceivable  that  the  function  of  Hem45/Isg20,  induced  by  estrogen 

and  localized  to  the  PODs,  is  to  down-regulate  the  estrogen- 

dependent  transcriptional  response  by  degrading  mRNAs  induced 
by  estrogen.  Consistent  with  this  hypothesis,  the  increase  of 
HEM45/ISG20  mRNA  level  in  immature  uterus  in  the  presence  of 
estrogen  occurred  later  than  those  of  estrogen  early  response  genes 
such  as  c-jun,  c-fos,  and  zif-268  (2).  c-jun,  c-fos,  and  zif-268  are 
transcription  factors  that  are  thought  to  mediate  the  early  stage  of 
the  uterine  response  to  estrogens  (2).  Future  experiments  will  be 
needed  to  test  these  hypotheses  about  the  precise  role  of 
Hem45/Isg20  in  the  cellular  interferon-dependent  antiviral 
response  and  estrogen  hormonal  signaling  pathway. 
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FIGURE  LEGENDS 


Figure  1.  Purity  of  Hem45/Isg20-His  and  Hem45/Isg20ExoII 
proteins.  A  15%  polyacrylamide  SDS  gel  containing  fractions  of 
the  various  purification  steps.  Lanes  1-5  are  Hem45/Isg20-His;  lanes 
6-10,  Hem45/Isg20D94G;  and  lane  11,  protein  markers.  Lanes  1 
and  10,  whole  cell  extract;  lanes  2  and  9,  whole  cell  supernatant; 
lanes  3  and  8,  0.15%  PEI  supernatant;  lanes  4  and  7,  flow-through  of 
the  Ni-affinity  chromatography;  and  lanes  5  and  6,  Ni-affinity 
purified  fraction.  Protein  marker  sizes  in  kDa  are  indicated  on  the 
right.  The  gel  was  stained  with  Coomassie  Blue  R250. 

Figure  2.  Time  course  of  Hem45/Isg20-His  exonuclease 
activity.  Hem45/Isg20-His  degrades  single  stranded  RNAs  and 
DNAs.  All  substrates  were  labeled  at  the  5?  end.  40  nM  of  RNA5  or 
DNA  substrates  (DNA5  or  3’  flap)  was  incubated  with  20  nM  or  200 
nM  of  Hem45/Isg20-His  protein,  respectively,  for  various  times  at 
3  7°C.  RNA5  or  DNA5  was  labeled  at  the  5’  end.  The  top  DNA  strand 
containing  the  flap  was  labeled  at  the  5’  end  also.  Lanes  1-6  are 
reactions  with  RNA5;  lanes  7-12,  DNA5;  lanes  13-18,  3’  flap  DNA 
substrate  of  42  mer  with  10  nucleotide  long  flap  at  the  3’  end. 

Lanes  1,  7,  and  13  are  the  no  protein  controls.  Lanes  2,  8,  and  14,  2 
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min  incubations  with  Hem45/Isg20-His;  lanes  3,  9,  and  15,  4  min 
incubations;  lanes  4,  10  and  16,  8  min  incubations;  lanes  5,  11,  17, 
16  min  incubations;  and  lanes  6,  12,  and  18,  32  min  incubations. 
Figure  3.  Hem45/Isg20-His  nuclease  activity  on  long  RNA 
substrates  with  or  without  a  stem-loop  structure  at  the  3’ 
end.  Lanes  1-4  are  reactions  with  uniformly  labeled  343  base  long 
RNA  with  the  TO  transcription  terminator  stem-loop  structure  at  the 
3’  end;  lanes  5-8,  uniformly  labeled  210  base  long  RNA;  and  lanes  9- 
12,  RNA5  labeled  at  the  5’  end.  Lanes  1,  5,  and  9  are  the  no  protein 
controls.  All  other  lanes  are  reactions  with  10  nM  of  Hem45/Isg20- 
His.  Lanes  2,  6,  and  10  are  30  min  reactions  at  37°C;  lanes  3,  7,  and 
11,  60  min  reactions;  and  lanes  4,  8  and  12,  90  min  reactions.  On 
the  right  of  the  figure,  band  A  is  unincorporated  a-P32-GTP;  band  B, 
Hem45/Isg20-His  degraded  mononucleotide  products;  band  C,  RNA5 
substrate;  band  D,  210  base  long  RNA;  and  band  E,  343  base  RNA 
with  a  3’  end  stem-loop  structure.  Signals  that  are  located 
immediately  on  top  of  band  B  in  lanes  5-8  are  probably  abortive 
initiation  RNA  products  from  linearized  DNA  promoter  template  but 
not  from  supercoiled  promoter  template  (25).  Band  E  signal  in 
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lanes  5-8  is  due  to  incomplete  linearization  of  the  promoter 
template  by  Xhol. 

Figure  4.  The  effect  of  a  D94G  mutation  on  the  DNase  and 
RNase  activities  of  Hem45/Isg20-His.  Exonuclease  activity  of 
Hem45/Isg20D94G  mutant  with  RNA5  and  3’  flap  DNA  substrate 
described  in  Figure  2.  Lanes  1-5  are  with  RNA5,  and  lanes  6-10  are 
with  3’  flap  substrates.  Lane  11  is  DNA  markers  of  42  mer  and  21 
mer.  The  substrate  concentration  is  40  nM.  Lane  1  is  RNA5 
undigested  control;  protein  control;  lane  2  and  3,  5  and  15  nM  of 
wild  type  Hem45/Isg20-His  (WT);  lane  4  and  5,  5  and  15  nM  of 
Hem45/Isg20D94G  mutant  (D94G  mutant);  lane  6,  3’  flap  DNA 
undigested  control;  lanes  7  and  8,  50  and  150  nM  of  WT  enzyme; 
and  lanes  9  and  10,  50  and  150  nM  of  D94G  mutant  enzyme.  The 
reactions  were  incubated  at  37°C  for  20  min.  Shown  is  a 
representative  of  three  independent  experiments. 

Figure  5.  pH  optima  of  Hem45/Isg20-His  exonuclease 
activity.  40  nM  of  5 ’-labeled  RNA5  was  incubated  with  2  nM  of 
Hem45/Isg20-His  for  10  minutes  37°C  at  various  pHs.  pH  was 
maintained  with  a  mixture  of  25  mM  acetic  acid,  25  mM  of  MES,  and 
50  mM  of  TrisHCl  (16).  The  x-axis  is  the  pH.  The  y-axis  is  the 
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relative  exonuclease  activity.  Maximal  activity  at  pH  7  is  arbitrarily 
designated  as  1.  Exonuclease  activity  is  proportional  to  the  amount 
of  labeled  RNA5  substrate  that  has  been  converted  to  the  final 
mononucleotide  product.  Shown  is  a  representative  of  three 
independent  experiments. 
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Table  1.  Metal  cofactor  requirement  of  Hem45/Isg20-Hisa. 


Metal 

Concentration 

Relative 

(mM) 

Activity 

MnCl2 

5 

0.85 

0.5 

1 

0.05 

0.15 

MgCl2 

5,0.5,  0.05 

<0.01 

CaCl2 

5,0.5,  0.05 

<0.01 

CuCl2 

5,0.5,  0.05 

<0.01 

ZnCl2 

5,0.5,  0.05 

<0.01 

NiCl2 

5,0.5,  0.05 

<0.01 

a40  nM  of  labeled  RNA5  substrate  was  incubated  with  2  nM  of 
Hem45/Isg20-His  enzyme  for  10  min  at  37°C  with  different 
concentrations  of  various  divalent  metals  shown.  Mononucleotide 
products  were  then  quantitated  by  Phosphorimager  scan  and  used 
to  represent  RNase  activity.  The  highest  RNase  activity  was 
arbitrarily  designated  as  1.  Shown  is  a  representative  of  three 
independent  experiments. 
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Figure  5 


